2 research outputs found

    Extracting relevant predictive variables for COVID-19 severity prognosis: An exhaustive comparison of feature selection techniques

    Get PDF
    With the COVID-19 pandemic having caused unprecedented numbers of infections and deaths, large research efforts have been undertaken to increase our understanding of the disease and the factors which determine diverse clinical evolutions. Here we focused on a fully data-driven exploration regarding which factors (clinical or otherwise) were most informative for SARS-CoV-2 pneumonia severity prediction via machine learning (ML). In particular, feature selection techniques (FS), designed to reduce the dimensionality of data, allowed us to characterize which of our variables were the most useful for ML prognosis. We conducted a multi-centre clinical study, enrolling n=1548 patients hospitalized due to SARS-CoV-2 pneumonia: where 792, 238, and 598 patients experienced low, medium and high-severity evolutions, respectively. Up to 106 patient-specific clinical variables were collected at admission, although 14 of them had to be discarded for containing ⩾60% missing values. Alongside 7 socioeconomic attributes and 32 exposures to air pollution (chronic and acute), these became d=148 features after variable encoding. We addressed this ordinal classification problem both as a ML classification and regression task. Two imputation techniques for missing data were explored, along with a total of 166 unique FS algorithm configurations: 46 filters, 100 wrappers and 20 embeddeds. Of these, 21 setups achieved satisfactory bootstrap stability (⩾0.70) with reasonable computation times: 16 filters, 2 wrappers, and 3 embeddeds. The subsets of features selected by each technique showed modest Jaccard similarities across them. However, they consistently pointed out the importance of certain explanatory variables. Namely: patient’s C-reactive protein (CRP), pneumonia severity index (PSI), respiratory rate (RR) and oxygen levels –saturation SpO2, quotients SpO2/RR and arterial SatO2/FiO2 –, the neutrophil-to-lymphocyte ratio (NLR) –to certain extent, also neutrophil and lymphocyte counts separately–, lactate dehydrogenase (LDH), and procalcitonin (PCT) levels in blood. A remarkable agreement has been found a posteriori between our strategy and independent clinical research works investigating risk factors for COVID-19 severity. Hence, these findings stress the suitability of this type of fully data-driven approaches for knowledge extraction, as a complementary to clinical perspectives

    Impact of outdoor air pollution on severity and mortality in COVID-19 pneumonia

    Get PDF
    The relationship between exposure to air pollution and the severity of coronavirus disease 2019 (COVID-19) pneumonia and other outcomes is poorly understood. Beyond age and comorbidity, risk factors for adverse outcomes including death have been poorly studied. The main objective of our study was to examine the relationship between exposure to outdoor air pollution and the risk of death in patients with COVID-19 pneumonia using individual-level data. The secondary objective was to investigate the impact of air pollutants on gas exchange and systemic inflammation in this disease. This cohort study included 1548 patients hospitalised for COVID-19 pneumonia between February and May 2020 in one of four hospitals. Local agencies supplied daily data on environmental air pollutants (PM10PM_{10}, PM2.5PM_{2.5}, O3O_3, NO2NO_2, NONO and NOXNO_X) and meteorological conditions (temperature and humidity) in the year before hospital admission (from January 2019 to December 2019). Daily exposure to pollution and meteorological conditions by individual postcode of residence was estimated using geospatial Bayesian generalised additive models. The influence of air pollution on pneumonia severity was studied using generalised additive models which included: age, sex, Charlson comorbidity index, hospital, average income, air temperature and humidity, and exposure to each pollutant. Additionally, generalised additive models were generated for exploring the effect of air pollution on C-reactive protein (CRP) level and SpO2O_2/FiO2O_2 at admission. According to our results, both risk of COVID-19 death and CRP level increased significantly with median exposure to PM10PM_{10}, NO2NO_2, NONO and NOXNO_X, while higher exposure to NO2NO_2, NONO and NOXNO_X was associated with lower SpO2O_2/FiO2O_2 ratios. In conclusion, after controlling for socioeconomic, demographic and health-related variables, we found evidence of a significant positive relationship between air pollution and mortality in patients hospitalised for COVID-19 pneumonia. Additionally, inflammation (CRP) and gas exchange (SpO2O_2/FiO2O_2) in these patients were significantly related to exposure to air pollution
    corecore